Corpus: ind_web_2011_10K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 91 99 99 99 99
1000 769 966 993 997 997
10000 4749 8478 9609 9863 9917
100000 4750 8479 9610 9864 9918
1000000 4750 8479 9610 9864 9918


Zipf's diagram for sentence endings


Gnuplot diagram

895 msec needed at 2018-04-30 00:23